The Statistical Inefficiency of Sparse Coding for Images (or, One Gabor to Rule them All)
نویسندگان
چکیده
Sparse coding is a proven principle for learning compact representations of images. However, sparse coding by itself often leads to very redundant dictionaries. With images, this often takes the form of similar edge detectors which are replicated many times at various positions, scales and orientations. An immediate consequence of this observation is that the estimation of the dictionary components is not statistically efficient. We propose a factored model in which factors of variation (e.g. position, scale and orientation) are untangled from the underlying Gabor-like filters. There is so much redundancy in sparse codes for natural images that our model requires only a single dictionary element (a Gabor-like edge detector) to outperform standard sparse coding. Our model scales naturally to arbitrary-sized images while achieving much greater statistical efficiency during learning. We validate this claim with a number of experiments showing, in part, superior compression of out-of-sample data using a sparse coding dictionary learned with only a single image.
منابع مشابه
Face Recognition using an Affine Sparse Coding approach
Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...
متن کاملTexture Classification of Diffused Liver Diseases Using Wavelet Transforms
Introduction: A major problem facing the patients with chronic liver diseases is the diagnostic procedure. The conventional diagnostic method depends mainly on needle biopsy which is an invasive method. There are some approaches to develop a reliable noninvasive method of evaluating histological changes in sonograms. The main characteristic used to distinguish between the normal...
متن کاملStatistical Principles in Image Modeling
Images of natural scenes contain a rich variety of visual patterns. To learn and recognize these patterns from natural images, it is necessary to construct statistical models for these patterns. In this review article we describe three statistical principles for modeling image patterns: the sparse coding principle, the minimax entropy principle, and the meaningful alignment principle. We explai...
متن کاملExtensions of Ica as Models of Natural Images and Visual Processing
Using statistical models one can estimate features from natural images, such as images that we see in everyday life. Such models can also be used in computional visual neuroscience by relating the estimated features to the response properties of neurons in the brain. A seminal model for natural images was linear sparse coding which, in fact, turned out to be equivalent to ICA. In these linear g...
متن کاملA Novel Image Denoising Method Based on Incoherent Dictionary Learning and Domain Adaptation Technique
In this paper, a new method for image denoising based on incoherent dictionary learning and domain transfer technique is proposed. The idea of using sparse representation concept is one of the most interesting areas for researchers. The goal of sparse coding is to approximately model the input data as a weighted linear combination of a small number of basis vectors. Two characteristics should b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1109.6638 شماره
صفحات -
تاریخ انتشار 2011